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Abstract 


The  impact  of  assimilating  Argo  data  into  initial  field  on  the  short-term 
forecasting  accuracy  of  temperature  and  salinity  is  quantitatively  estimated  by  using  a 
forecasting  system  of  the  western  North  Pacific,  on  the  base  of  the  Princeton  Ocean 
Model  with  generalized  coordinate  system  (POMgcs).  This  system  uses  a  sequential 
multi-grid  three-dimensional  variational  (3DVAR)  analysis  scheme  to  assimilate 
observation  data.  Two  numerical  experiments  were  conducted  with  and  without  Argo 
temperature  and  salinity  profile  data  besides  conventional  temperature  and  salinity 
profile  data  and  sea  surface  height  anomaly  (SSHa)  and  sea  surface  temperature  (SST) 
in  the  process  of  assimilating  data  into  initial  fields.  The  forecast  errors  are  estimated 
through  using  independent  temperature  and  salinity  profiles  during  the  forecasting 
period,  including  the  vertical  distributions  of  the  horizontally  averaged  root  mean 
square  errors  (H-RMSEs)  and  horizontal  distributions  of  the  vertically  averaged  mean 
errors  (MEs)  and  temporal  variation  of  spatially  averaged  root  mean  square  errors 
(S-RMSEs).  Comparison  between  the  two  experiments  shows  that  the  assimilation  of 
Argo  data  significantly  improves  the  forecast  accuracy,  with  24%  reduction  of 
H-RMSE  maximum  for  the  temperature,  and  the  salinity  forecasts  are  improved  more 
obviously,  averagely  dropping  of  50%  for  H-RMSEs  in  depth  shallower  than  300m. 
Such  improvement  is  caused  by  relatively  uniform  sampling  of  both  temperature  and 
salinity  from  the  Argo  drifters  in  time  and  space. 

Key  words:  Data  assimilation,  Argo  data,  Western  North  Pacific,  Ocean  prediction 
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1.  Introduction 


Data  assimilation,  required  in  operational  ocean  data  retrieval,  has  contributed 
significantly  to  the  success  of  ocean  prediction.  It  is  to  blend  modeled  variable  (xm) 
with  observational  data  (y0)  (Chu  et  al.,  2004;  Chu  et  al.,  2010), 

Xa=Xm+W«[j0-H(Xm)]  (1) 

where  va  is  the  assimilated  variable;  H  is  an  operator  that  provides  the  model’s 
theoretical  estimate  of  what  is  observed  at  the  observational  points,  and  W  is  the 
weight  matrix.  Difference  among  various  data  assimilation  schemes  such  as  optimal 
interpolation  (Chu  et  al.,  2007a;  Chu  et  al.,  2007b),  Kalman  filter  (Galanis  et  al., 
2011),  and  three-dimensional  variational  (3DVAR)  methods  (Li  et  al.,  2008)  is  the 
different  ways  to  determine  the  weight  matrix  W.  The  data  assimilation  process  (1) 
can  be  considered  as  the  average  (in  a  generalized  sense)  of  xm  and  y0.  The  two  parts 
(xm  and  y0 )  in  the  assimilation  process  usually  have  very  different  characteristics  in 
terms  of  data  temporal  and  spatial  distribution:  uniform  and  dense  in  the  modeled  data 
(xm),  and  non-uniform  and  sparse  in  the  observed  data  (y0)-  Question  arises:  What  is 
the  impact  of  data  sampling  strategies  in  the  assimilation  of  initial  field  on  the 
forecasting  accuracy?  To  answer  this  question,  two  observational  datasets  are  needed 
with  different  types  of  data  distribution  patterns  in  space  and  time.  One  is  relatively 
uniform,  and  the  other  is  not. 

The  Global  Temperature  and  Salinity  Profile  Program  (GTSPP),  as  a  cooperative 
international  project,  has  been  established  since  1990  to  provide  global  temperature  (T) 
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59  and  salinity  (S)  resources.  GTSPP  contains  conventional  temperature  and  salinity 

60  profile  data  such  as  Nansen  bottle,  conductivity-temperature-depth  (CTD),  and 

61  bathythermograph  (BT),  which  are  usually  collected  from  ships.  Since  the  Array  for 

62  Real-time  Geostrophic  Oceanography  (Argo)  is  launched  into  practice,  GTSPP  (T,  S) 

63  profiles  increase  rapidly  in  both  quantity  and  quality.  It  becomes  possible  to  monitor 

64  the  temporal  and  spatial  variations  of  temperature  and  salinity  simultaneously.  Liu  et 

65  al.  (2004)  showed  significant  improvement  of  temperature  prediction  in  the  central 

66  Pacific  using  a  global  ocean  model  with  Argo  data  assimilation.  Griffa  et  al.  (2006) 

67  analyzed  the  impact  of  Argo  data  assimilation  on  a  Mediterranean  prediction  model 

68  by  a  set  of  idealized  experiments,  and  discussed  the  impact  of  coverage  density  and 

69  locations  of  Argo  data  on  assimilation  results. 

70  Due  to  the  limitation  of  ship  time,  the  conventional  (T,  S)  profile  data  are 

71  non-uniformly  distributed  in  space  and  time.  However,  the  Argo  floats  drift  freely 

72  with  ocean  currents,  the  Argo  data  are  more  uniformly  distributed  in  space  and  time 

73  than  the  conventional  data.  Such  difference  in  data  distributions  between  the 

74  conventional  (non-uniform)  and  Argo  (relatively  uniform)  (T,  S)  profile  data  provides 

75  an  opportunity  to  study  the  effect  of  the  sampling  strategies  on  the  ocean  prediction 

76  accuracy.  To  do  so,  a  numerical  forecasting  system  with  3DVAR  in  the  western 

77  Pacific  regional  seas  (Fig.  1)  is  constructed  with  the  capability  to  assimilate  sea 

78  surface  height  anomaly  (SSHa)  from  altimeters  and  sea  surface  temperature  (SST) 

79  from  satellite  remote  sensors,  as  well  as  in-situ  conventional  and  Argo  (T,  S)  profiles 

80  in  the  determining  of  the  initial  conditions.  A  seven-day  forecast  is  conducted  with 
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and  without  the  assimilation  of  Argo  (T,  S)  profiles  in  initial  field.  The  prediction 
accuracy  is  verified  with  independent  temperature  and  salinity  profiles  during  the 
period  of  prediction  (not  used  in  the  data  assimilation  of  initial  field).  Difference 
between  the  two  forecast  experiments  shows  the  impact  of  data  distribution  on  the 
ocean  prediction  accuracy. 

Frame  of  the  paper  is  outlined  as  follows.  Section  2  shows  the  basic  features  of 
conventional  and  Argo  profile  data.  Section  3  describes  the  ocean  dynamic  model  and 
ocean  data  assimilation  scheme.  Section  4  gives  the  experiment  design  and  the 
quantitative  analysis  on  the  improvement  of  ocean  prediction  using  the  Argo  data 
assimilation.  Section  5  presents  the  conclusions. 

Figure  1 


2.  Data 

Ocean  observational  Data  (January-December  2008)  include  SSFIa  from 
multi- satellite  altimeters  and  SST  from  satellite  remote  sensors,  and  (T,  S)  profiles 
(conventional  and  Argo)  from  GTSPP.  The  satellite  SSFIa  and  SST  data  are  on  the 
horizontal  resolution  of  0.25°  and  the  time  increment  of  1  day.  Quality  control  is 
conducted  on  both  conventional  and  Argo  profile  data  before  assimilating  them  into 
the  initial  field  of  the  numerical  forecasting.  For  the  conventional  data,  it  includes 
position/time  check,  depth  duplication  check,  depth  inversion  check,  temperature  and 
salinity  range  check,  excessive  gradient  check,  and  stratification  stability  check.  For 


5 


101  the  Argo  floats,  it  includes  duplicate  float  test,  land  position  test,  float  drafting 

102  velocity  test,  pressure  range  test,  temperature  and  salinity  coherence  test,  pressure 

103  level  duplication  test  and  pressure  inversion  test,  spike  test,  salinity  and  temperature 

104  gradient  test,  and  stratification  stability  test,  etc.  In  addition,  the  calibration  method 

105  developed  by  Wong  et  al.  (2003)  is  employed  to  calibrate  the  sensor  drift  of  salinity 

1 06  measurements  in  the  Argo  data. 

107  Figure  2  shows  the  horizontal  distribution  of  (T,  S)  profile  data.  From  January  to 

108  December  2008,  there  are  60634  temperature  profiles  and  52638  salinity  profiles  from 

109  conventional  observations,  5323  temperature  profiles  and  5210  salinity  profiles  from 

110  Argo  floats.  That  is  to  say,  the  Argo  data  is  near  1/10  of  the  conventional  data.  The 

111  conventional  (T,  S)  profiles  are  distributed  non-uniformly  in  horizontal  with  most 

112  profiles  around  Japan  and  east  of  Taiwan  and  much  less  profiles  in  the  other  regions, 

113  and  existence  of  some  data-void  areas.  The  Argo  (T,  S)  profiles  are  distributed 

114  uniformly  (relative)  over  the  whole  area.  Figure  3  shows  the  vertical  distributions  of 

115  numbers  of  observations  for  temperature  and  salinity  from  conventional  and  Argo  data. 

116  The  conventional  temperature  (salinity)  observations  decrease  slowly  from  57597 

117  (48595)  data  points  near  the  surface  to  about  40000  (T  and  S)  data  points  at  near  700 

118  m  depth,  and  reduce  drastically  to  around  2000  (T  and  S)  data  points  below  700  m 

119  depth  (Fig.  3a).  The  Argo  temperature  (salinity)  observations  have  5299  (5186)  data 

120  points  from  near  surface  to  about  420  m  depth,  decrease  almost  linearly  to  2000  (T 

121  and  S)  data  points  at  about  1500  m  depth,  keep  2000  (T  and  S)  data  points  from  1500 

122  to  1800  m  depth,  and  reduce  to  less  than  100  data  points  at  2000  m  depth  (Fig.  3b). 
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Two  (T,  S)  datasets  are  used  to  investigate  the  impact  of  the  sampling  strategies 
on  the  ocean  prediction  accuracy.  The  first  dataset  (called  “WITHARGO”)  contains 
Argo  profile  data  besides  conventional  profiles,  SSHa  and  SST  and  represents 
horizontally  uniform  (relative)  sampling.  The  second  dataset  (called  “NO  ARGO”) 
contains  only  the  conventional  profile  data,  SSHa  and  SST  and  represents  horizontally 
non-uniform  sampling. 

Figures  2,  3 


3.  Ocean  Prediction  System 

3.1  Ocean  Model 

The  ocean  model  used  in  this  study  is  the  Princeton  Ocean  Model  with 
generalized  coordinate  system  (POMgcs).  The  study  domain  covers  from  99°E  to 
150°E  in  longitude,  and  from  10°N  to  52°N  in  latitude  (Fig.  1),  with  variable 
horizontal  resolution  starting  from  1/12°  near  the  coastal  waters  of  China  and 
Kuroshio ,  and  telescoping  to  1/2°  at  other  areas.  The  vertical  coordinate  is  a 
combination  of  sigma  and  z-level  with  a  maximum  depth  of  5035  m,  discretized  by  35 
model  levels.  In  the  vicinity  of  upper  mixed  layer  and  thermocline,  z-coordinate  is 
adopted  in  order  to  get  a  higher  vertical  resolution.  In  shallow  water  and  the  area  near 
bottom  boundary,  the  terrain-following  o-coordinate  is  used.  Sea  surface  forcing 
fields  consist  of  winds,  air  temperatures,  humidity  and  clouds  from  the  National 
Centers  for  Environmental  Prediction  (NCEP)  reanalysis.  Sea  surface  heat  fluxes  are 
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143  calculated  by  bulk  formula,  and  open  boundary  conditions  are  provided  by  the 

144  simulation  results  of  Massachusetts  Institute  of  Technology  general  circulation  model 

145  (MITgcm,  Marshall  et  al.,  1997),  including  daily  Sea  level,  temperature,  salinity,  and 

146  currents.  These  open  boundary  data  are  interpolated  to  the  grid  and  time  step  of  the 

147  forecasting  system. 

148  3.2  Ocean  Data  Assimilation  Scheme 

149  The  ocean  data  assimilation  scheme  used  in  the  system  is  a  sequential 

150  three-dimensional  variational  (3DVAR)  analysis  scheme  designed  to  assimilate 

151  temperature  and  salinity  using  a  multi-grid  framework  (Li  et  al.,  2008).  This 

152  sequential  3DVAR  analysis  scheme  can  be  performed  in  three  dimensional  spaces  and 

153  can  retrieve  resolvable  information  from  longer  to  shorter  wavelengths  for  a  given 

154  observation  network  and  yield  multi-scale  analysis.  The  basic  idea  of  this  data 

155  assimilation  scheme  can  be  referred  to  Li  et  al.  (2008)  and  Li  et  al.  (2010). 

156  The  data  assimilation  is  carried  out  in  the  upper  1000m.  The  basic  idea  proposed 

157  by  Troccoli  et  al.  (2002)  is  employed  to  make  salinity  adjustment  for  the  background 

158  field  after  temperature  data  is  assimilated.  The  area  extent  of  adjustment  is  limited 

159  between  the  latitude  of  30°S-30°N  and  depth  of  50- 1000m.  It  needs  firstly  to  establish 

160  a  T-S  relationship  by  using  interpolation  algorithm  based  on  the  instant  model  T-S 

161  table.  Then  the  background  field  of  salinity  is  adjusted  based  on  the  T-S  relationship 

162  and  temperature  analysis  result.  In  addition,  an  idea  of  converting  satellite  altimeter 

163  SSHa  into  T-S  “pseudo  profiles”  based  on  the  3DVAR  scheme  is  adapted  ((Zhu  and 
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Figure  4  shows  the  flow  chart  for  data  assimilation  procedure:  (1)  Based  on  24-h 
forecasting  (T,  S)  values,  obtain  the  T-S  relationship  at  every  grid  point  through  using 
the  T-S  relationship  module;  (2)  Convert  altimeter  SSHa  into  “pseudo  profiles”  of 
temperature  and  salinity;  (3)  Assimilate  temperature  data  to  obtain  temperature 
analysis  field;  (4)  Adjust  24-h  forecasting  salinity  field  on  the  base  of  the  T-S 
relationship  and  temperature  analysis  result,  and  take  the  adjusted  salinity  field  as  the 
background  field  for  salinity  assimilation;  (5)  Assimilate  salinity  data  to  obtain 
salinity  analysis  field;  (6)  the  temperature  and  salinity  analysis  fields  are  used  as  the 
initial  conditions  of  next  seven-day  forecast. 

Figure  4 

3.3.  Experiment  Design 

Two  forecast  experiments  are  designed.  The  first  experiment  (called 
“NO  ARGO”)  assimilates  all  available  observations  (conventional  T,  S  profiles  and 
SSHa  and  SST)  except  the  Argo  profile  data.  The  second  experiment  (called 
“WITH_ARGO”)  assimilates  all  available  observations  including  the  Argo  profile 
data.  Both  experiments  use  the  same  sea-surface  forcing  fields  and  open  boundary 
conditions.  The  China  Ocean  ReAnalysis  (CORA)  fields  of  January  1,  2008  (Han  et 
al.,  2011,  http://www.cora.net.cn)  are  used  as  initial  conditions.  First,  a  seven-day 
forecast  is  performed  for  both  experiments.  Second,  the  data  assimilation  is  performed 
using  24-hour  forecast  values  as  the  background  field.  Taking  the  assimilated  fields  as 
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185  initial  conditions,  the  next  seven-day  forecast  is  performed.  This  procedure 

186  (forecast-assimilation-forecast)  is  cycled  365  times  to  obtain  24-hour,  48-hour, 

187  72-hour,  96-hour,  120-hour,  144-hour,  168-hour  forecast  values  of  temperature  and 

188  salinity  fields  in  every  day  of  2008.  The  time  window  of  assimilating  SST  and  SSHa 

1 89  data  in  both  experiments  is  set  to  one  day,  namely  assimilating  satellite  data  within  the 

190  one  day  before  initial  forecasting  time.  Since  the  spatial  distributions  of  conventional 

191  observations  and  Argo  data  are  sparse,  both  experiments  adopt  the  3. 5 -day  time 

192  window,  namely  assimilating  ocean  (7,  S)  profile  data  within  the  3.5  days  before 

193  initial  forecasting  time.  Since  all  temperature  and  salinity  observational  data  during 

194  the  period  of  forecasting  are  not  assimilated  into  background  fields  (initial  field  of  the 

195  numerical  forecasting),  they  are  taken  as  independent  data  to  be  used  to  check  the 

196  forecast  result.  Based  on  these  independent  observation  data,  the  errors  of  the  24-hour, 

197  48-hour,  72-hour,  96-hour,  120-hour,  144-hour,  and  168-hour  forecast  values  of  the 

198  temperature  and  salinity  at  each  grid  point  in  every  day  of  2008  can  be  estimated.  The 

199  vertical  distributions  of  forecast  errors  are  obtained  by  averaging  the  errors  in  the 

200  horizontal  direction.  The  horizontal  distributions  of  forecast  errors  are  obtained  by 

201  averaging  the  errors  in  the  vertical  direction.  Difference  of  forecast  errors  between  the 

202  two  experiments  shows  the  effect  of  sampling  strategies  on  the  ocean  prediction 

203  accuracy. 
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4.  Effect  of  Argo  Data 


4.1  Whole  3D  Domain 


To  quantify  the  impact  of  assimilating  Argo  data  on  an  ocean  prediction  errors, 
the  horizontally  averaged  root  mean  square  error  (H-RMSE)  between  predicted  and 
observed  values  for  the  whole  horizontal  region  at  depth  z*  and  time  tm  is  calculated 
by 


H-  RMSE^  (zk ,  tm  )  =  P-  Yj  \yP  (xn » y„  >  zk » tm  )  -  Y°  (*„ ,  y„ ,  h ,  tm  )]2 

V  N  n= 1 


(2) 


where  xn  and  y„  indicate  the  zonal  and  latitudinal  coordinates  of  the  nth  observation 
point,  respectively;  z*  is  the  depth  of  the  /cth  level;  is  the  /77th  forecasting  time;  N  is 
total  number  of  observation  points  at  the  /,„  time  and  depth;  y/p  (xn  ,yn,zk,tm  )  and 
i//"  (.xn ,  yn ,  zk ,  tm )  respectively  denote  the  predicted  and  ground-truth  values  at  the  tm 
time  and  z/f  depth  for  the  point  (xn,yn).  In  the  study,  indicates  temperature  (7)  or 
salinity  (S).  H-RMSE^^,/,)  can  be  used  to  evaluate  the  overall  performance  for 
the  whole  depths. 

Figure  5  a  and  b  show  the  vertical  distribution  of  H-RMSEs1'  for  /|=24-hour  and 
t2=  168-hour  forecasts  with  and  without  Argo  profiles  assimilation.  Since  the  high 
resolution  and  horizontally  uniform  satellite  remote  sensing  SST  data  are  assimilated, 
inclusion  of  Argo  data  does  not  improve  the  accuracy  of  SST  prediction. 

H-RMSEs1  ri  at  time  t\  and  increase  with  depth  from  the  surface  to  its 
maximum  value  at  around  158  m  depth,  where  is  the  mean  thermocline  location, 
reduce  drastically  to  0.5  °C  at  around  1000  m  depth,  and  reduce  gradually  to  0.25  °C 
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to  2000  m  depth.  The  low  value  of  H-RMSE(r)  below  1000  m  depth  for  all  cases  may 
be  caused  by  the  low  variability. 

For  24-hour  forecast  (Fig.  5a),  the  maximum  value  of  H-RMSE(r)  is  2.1  °C 
without  Argo  data  assimilation  and  1.6°C  with  Argo  data  assimilation  (24%  error 
reduction).  The  improvement  of  ocean  prediction  is  very  evident  until  1000  m  depth. 
Since  the  value  of  H-RMSE(r)  below  1000  m  depth  is  already  small  (0.25-0.5  °C),  the 
improvement  with  the  Argo  data  is  not  noticeable.  Such  improvement  in  upper  1000 
m  especially  at  around  158  m  depth  is  still  evident  in  168-hour  forecast  (Fig.  5b). 

Figure  5 

Figure  5  c  and  d  show  the  vertical  distribution  of  H-RMSEs(S)  for  6=24-hour  and 
t2=  168-hour  forecasts  with  and  without  Argo  profile  data  assimilation.  Similar  to  the 
temperature  prediction,  the  H-RMSE  of  salinity  for  all  cases  reduces  evidently  from 
the  surface  to  depth  around  1200  m,  and  reduces  gradually  below  1200  m.  The  low 
value  of  H-RMSE(5  below  1200  m  depth  is  related  to  the  low  variability.  Without 
Argo  data  assimilation,  H-RMSEs<  Vl  at  time  t\  and  ti  are  very  large,  with  more  than 
0.5  psu  for  depths  shallower  than  300  m.  With  Argo  data  assimilation,  they  decrease 
drastically  to  less  than  0.23  psu  for  24-hours  forecast  and  0.25  psu  for  168-hour 
forecast  with  error  reduction  more  than  50%.  Below  1200  m  depth,  H-RMSEs'S)  at 
time  t\  and  h  are  quite  small  with  slightly  larger  values  in  “WITH  ARGO” 
experiment  than  in  the  “NO_ARGO”  experiment.  This  may  be  related  that  the  depth 
of  assimilating  date  is  limited  to  upper  1000m.  A  further  study  is  needed  to  explain 
such  phenomena. 
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4.2  Near  Thermocline 
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The  mean  errors  (ME)  within  the  layers  between  z«  and  z*2  at  time  tm  is 
calculated  using  Eq.(3)  to  identify  the  forecast  system  performance. 


Kt~L 


(3) 


Where  all  letters  express  the  same  means  as  ones  in  the  Eq.(2)  and  k\ ,  represents 
the  £ith  and  feth  level,  respectively;  K  equals  to  k\-k2.  Here,  to  evaluate  the  forecast 
performance  near  the  mean  thermocline,  the  depths  of  the  £ith  and  feth  level  are  100m 
and  300m,  respectively,  and  the  tm  is  24-hour. 

Figure  6  a  and  b  show  the  horizontal  distributions  of  the  vertically  (100-300  m) 
averaged  temperature  mean  errors  in  24-hour  forecast  without  and  with  Agro  data 
assimilation,  respectively.  Without  Agro  data  assimilation,  the  predicted  temperatures 
are  lower  than  observations  in  most  areas.  In  the  east  areas  of  Japan,  the  predicted 
temperatures  are  0.8  °C  higher  than  observations.  With  Argo  data  assimilation,  the 
predicted  temperatures  are  significantly  improved,  and  the  forecast  errors  are  0.1°C 
or  less  in  the  whole  areas.  Therefore,  the  assimilation  of  Argo  data  can  reduce  errors 
of  temperature  forecast  dramatically  near  the  mean  thermocline. 

Figure  6 

Figure  6  c  and  d  show  the  horizontal  distributions  of  the  vertically  (100-300  m) 
averaged  salinity  mean  errors  in  24-hour  forecast  without  and  with  Agro  data 
assimilation,  respectively.  Without  Agro  data  assimilation,  the  predicted  salinity  is 
significantly  lower  than  observations  in  most  areas.  For  example,  the  predicted 
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salinity  is  over  0.5  psu  lower  than  observation  in  the  area  of  15°N-35°N.  However,  the 


predicted  salinity  is  significantly  higher  than  observation  in  the  small  east  area  of 
Japan.  It  indicates  that  an  obvious  bias  exits  for  salinity  forecast  without  Argo  data 
assimilation.  With  Argo  data  assimilation,  the  predicted  salinity  is  significantly 
improved,  and  the  forecast  errors  are  0.2  psu  or  less  in  the  whole  areas.  Therefore,  the 
assimilation  of  Argo  data  can  reduce  errors  of  salinity  forecast  dramatically  near  the 
mean  halocline. 


4.3  Error  Evolution 


The  spatially  averaged  root  mean  square  error  (S-RMSE)  between  predicted  and 
observed  values  for  the  whole  horizontal  region  within  the  layers  between  Zk\  and  zta 
and  at  time 


S-RMSE^HJ: 


I  ft2  A  2 

f— ^^Y\yp^y^zM-w\xn,yn,zk,tm)\ 

VVA  k=ki  n=l 


(4) 


is  also  used  for  the  evaluation.  Just  as  Eq.(3),  all  letters  in  the  Eq.(4)  express  the  same 
means  as  ones  in  the  Eq.(2). 

The  S-RMSEs  of  temperature  are  calculated  using  Eq.(4)  for  upper  (0-50m)  and 
lower  (50-1 000m)  layers  to  analysis  the  errors  growth  (Fig.  7).  The  S-RMSEs^  are 
generally  lager  and  grow  faster  in  the  upper  layer  than  in  the  lower  layer.  For  the 
upper  layer,  without  Argo  data  assimilation,  the  S-RMSE(7^  is  1.33  °C  for  24-hour 
forecast,  and  1.51  °C  for  168-hour  forecast  (14%  increasing).  With  Argo  data 
assimilation,  the  S-RMSE(r)  is  1.26°C  for  24-hour  forecast,  and  1.49°C  for  168-hour 
forecast  (18%  increasing).  For  the  lower  layer,  without  Argo  data  assimilation,  the 
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S-RMSE(r)  is  1.15°C  for  24-hour  forecast,  and  1.18°C  for  168-hour  forecast  (3% 
increasing).  With  Argo  data  assimilation,  the  S-RMSEm  is  0.93  °C  for  24-hour 
forecast,  and  1.03°C  for  168-hour  forecast  (11%  increasing). 

With  Argo  data  assimilation,  the  accuracy  of  temperature  forecasts  is 
significantly  improved.  However,  it  is  worthy  note  that  the  forecast  errors  in  the 
“WITHARGO”  experiment  grow  a  little  faster  compared  to  those  in  the 
“NOARGO”  experiment.  This  is  because  the  assimilation  of  Agro  data  just  improves 
the  accuracy  of  initial  conditions  and  can  not  correct  the  model  systematic  bias.  As  a 
result,  the  forecast  error  around  initial  forecast  time  in  the  “WITH  ARGO” 
experiment  is  mainly  determined  by  the  accuracy  of  initial  conditions  and  much  lower 
than  ones  in  the  “NO  ARGO”  experiment,  and  with  the  increase  of  the  forecast  time, 
the  forecast  error  is  mainly  affected  by  model  systematic  bias  so  that  the  forecast  error 
with  assimilation  of  Argo  data  increases  sharply. 

Figure  7,  8 

Same  as  the  temperature,  the  S-RMSEs  of  salinity  are  calculated  using  Eq.(4)  for 
upper  (0-300m)  and  lower  (300-1 000m)  layers  to  identify  the  errors  growth  (Fig.  8). 
S-RMSEs<Sl  are  generally  lager  in  the  upper  layer  than  in  the  lower  layer.  For  the 
upper  layer,  without  Argo  data  assimilation,  the  S-RMSE<5)  is  near  0.5  psu  for  the 
whole  prediction  period.  With  Argo  data  assimilation,  the  S-RMSEI,S)  is  0.17  psu  for 
24-hour  forecast,  and  0.22  psu  for  168-hour  forecast,  much  less  than  50%  of  that 
without  Argo  data  assimilation.  For  the  lower  layer,  without  Argo  data  assimilation, 
the  S-RMSE(S)  is  near  0.15  psu  for  the  whole  prediction  period.  With  Argo  data 
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assimilation,  the  S-RMSEs®  are  0.07  psu  and  0.09  psu  for  72-hour  and  longer 
forecast,  and  the  S-RMSEs'S)  reduce  around  40%  relative  to  that  without  Argo  data 
assimilation.  So,  with  Argo  data  assimilation,  the  accuracy  of  salinity  forecasts  is 
significantly  improved. 

Figures  9 

4.4  Vertical  Cross  Sections 

A  set  of  CTD  temperature  measurements  (not  being  used  in  the  data  assimilation) 
is  used  for  the  evaluation.  It  was  conducted  on  23  February  2008  along  129°E  south 
of  Japan.  Figure  9a  gives  the  distribution  of  observational  temperatures  for  the  129°E 
cross-section,  while  Fig.  9b  and  c  show  results  of  24-hour  forecast  for  both 
experiments.  Temperature  field  with  Argo  data  assimilation  is  closer  to  observations 
than  that  without  Argo  data  assimilation. 

The  section  along  38.5°E  east  of  Japan  during  8  May  2008  is  used  for  illustration. 
Figure  10a  gives  the  distribution  of  observational  salinity,  while  Fig.  10b  and  c  show 
results  of  24-hour  forecast  for  both  experiments.  Just  as  temperature  section,  salinity 
field  with  Argo  data  assimilation  is  closer  to  observations  than  that  without  Argo  data 
assimilation. 

5.  Conclusion 

A  forecast  system  based  on  the  Princeton  Ocean  Model  with  generalized 
coordinate  system  (POMgcs)  and  sequential  multi-grid  3DVAR  analysis  scheme  is 
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developed  for  the  western  Pacific  marginal  seas  to  investigate  the  impact  of  sampling 


strategies  on  the  ocean  prediction  through  using  two  (T,  S)  profile  datasets.  The  first 
dataset  contains  both  conventional  and  Argo  profile  data  (called  “WITH  ARGO”) 
and  represents  horizontally  uniform  (relative)  sampling.  The  second  dataset  contains 
only  the  conventional  profile  data  (called  “NO  ARGO”)  and  represents  horizontally 
non-uniform  sampling. 

Without  Argo  data  assimilation  (i.e.,  non-uniform  sampling),  temperature  and 
salinity  forecast  have  obvious  biases.  Especially  in  the  area  of  15°N-35°N  the 
predicted  temperature  and  salinity  are  obviously  smaller  than  observations.  With  Argo 
data  assimilation,  these  biases  are  corrected.  Based  on  the  detailed  comparison  of 
horizontally  averaged  root  mean  square  error  (H-RMES)  between  the  two 
experiments,  it  is  known  that  the  temperature  H-RMSE  maximum  drops  by  24%  and 
the  salinity  H-RMSEs  in  depth  shallower  than  300m  drop  averagely  by  50%  if  the 
Argo  data  is  assimilated  into  initial  fields,  and  the  accuracy  of  salinity  forecast  is 
improved  more  obviously  than  temperature  forecast.  With  Argo  data  assimilation,  the 
temperature  or  salinity  distribution  along  some  vertical  cross  sections  is  nearer  to 
observations  than  that  without  Argo  data  assimilation.  It  indicates  that  the  assimilation 
of  Argo  data  plays  an  important  role  in  the  process  of  constructing  initial  fields,  and  it 
can  significantly  improves  the  temperature  and  salinity  forecasts.  It  is  worthy  note  that 
although  the  forecast  errors  within  assimilation  depth  (shallower  than  1 000m)  can  be 
sharply  reduced  though  assimilating  Argo  data  into  initial  filed,  the  errors  below 
1000m  depth  change  very  small,  or  even  can  slightly  increase.  A  further  study  is 
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353  needed  to  explain  such  phenomena. 
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Figure 


Fig.  1.  Geography  of  the  Western  North  Pacific.  The  dots  indicate  the  numerical  grid 
points. 

Fig.  2.  Spatial  distribution  of  temperature  (a)  and  salinity  (b)  profiles  from  GTSPP 

during  Jan-Dee  2008  (Red  dot:  conventional  data;  Blue  dot:  Argo  data). 

Fig.  3.  Vertical  distributions  of  numbers  of  observations  for  temperature  (red)  and 

salinity  (blue)  from  conventional  (a)  and  Argo  data  (b). 

Fig.  4.  Flow  chart  of  multi-grid  3DVAR  operational  procedure. 

Fig.  5.  Vertical  dependence  of  temperature  (a,  b,  )  and  salinity  (c,  d,  psu)  H-RMSEs 

in  24-hour  forecast  (a,  c)  and  168-hour  forecast  (b,  d)  with  and  without  Argo  data 

assimilation. 

Fig.  6.  Horizontal  distribution  of  vertically  (100-300  m)  averaged  temperature  (a,  b, 
°C)  and  salinity  (c,  d,  psu)  prediction  errors  in  24-hour  forecast  without  Argo  profiles 
assimilation(a,  c)  and  with  Argo  profiles  assimilation(b,  d). 

Fig.  7.  Temporal  variation  of  temperature  S-RMSEs  (°C)  for  the  layers  of  0-50m(a) 
and  50-1000m(b)  in  24-hour  forecast  with  and  without  Argo  data  assimilation. 

Fig.  8.  Temporal  variation  of  salinity  S-RMSEs  (psu)  for  the  layers  of  0-300m(a)  and 
300-1000m(b)  in  24-hour  forecast  with  and  without  Argo  data  assimilation. 

Fig.  9.  Vertical  temperature  cross-section  along  129°E  south  of  Japan  on  23  February 
2008:  (a)  observation  (dark  dots:  stations),  (b)  24-hour  forecast  without  assimilating 
Argo  profiles,  and  (c)  24-hour  forecast  with  assimilating  Argo  profiles. 

Fig.  10.  Vertical  salinity  cross-section  along  38.5°N  east  of  Japan  on  8  May  2008:  (a) 
observation  (dark  dots:  stations),  (b)  24-hour  forecast  without  assimilating  Argo 
profiles,  and  (c)  24-hour  forecast  with  assimilating  Argo  profiles. 
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Fig.  1.  Geography  of  the  Western  North  Pacific.  The  dots  indicate  the  numerical  grid 
points. 
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Fig.  2.  Spatial  distribution  of  temperature  (a)  and  salinity  (b)  profiles  from  GTSPP 
during  Jan-Dee  2008  (Red  dot:  conventional  data;  Blue  dot:  Argo  data). 
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Fig.  3.  Vertical  distributions  of  numbers  of  observations  for  temperature  (red)  and 
salinity  (blue)  from  conventional  (a)  and  Argo  data  (b). 
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Fig.  4.  Flow  chart  of  multi-grid  3DVAR  operational  procedure. 
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502  RMS  error 

503  Fig.  5.  Vertical  dependence  of  temperature  (a,  b,  )  and  salinity  (c,  d,  psu)  H-RMSEs 

504  in  24-hour  forecast  (a,  c)  and  168-hour  forecast  (b,  d)  with  and  without  Argo  data 

505  assimilation. 
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Fig.  6.  Horizontal  distribution  of  vertically  (100-300  m)  averaged  temperature  (a,  b, 
°C)  and  salinity  (c,  d,  psu)  prediction  errors  in  24-hour  forecast  without  Argo  profiles 
assimilation(a,  c)  and  with  Argo  profiles  assimilation(b,  d). 
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Fig.  7.  Temporal  variation  of  temperature  S-RMSEs  (°C)  for  the  layers  of  0-50m(a) 


and  50-1000m(b)  in  24-hour  forecast  with  and  without  Argo  data  assimilation. 
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Fig.  8.  Temporal  variation  of  salinity  S-RMSEs  (psu)  for  the  layers  of  0-300m(a)  and 
300-1000m(b)  in  24-hour  forecast  with  and  without  Argo  data  assimilation. 
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Fig.  9.  Vertical  temperature  cross-section  along  129°E  south  of  Japan  on  23  February 
2008:  (a)  observation  (dark  dots:  stations),  (b)  24-hour  forecast  without  assimilating 
Argo  profiles,  and  (c)  24-hour  forecast  with  assimilating  Argo  profiles. 
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571  Fig.  10.  Vertical  salinity  cross-section  along  38.5°N  east  of  Japan  on  8  May  2008:  (a) 

572  observation  (dark  dots:  stations),  (b)  24-hour  forecast  without  assimilating  Argo 

573  profiles,  and  (c)  24-hour  forecast  with  assimilating  Argo  profiles. 
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